Iranian Journal of Numerical Analysis and Optimization 
Vol. 6, No. 1, (2016), pp 101-119 


A nonmonotone trust-region-approach 
with nonmonotone adaptive radius for 
solving nonlinear systems 
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Abstract 


This paper presents a trust-region procedure for solving systems of non- 
linear equations. The proposed approach takes advantages of an effective 
adaptive trust-region radius and a nonmonotone strategy by combining both 
of them appropriately. It is believed that selecting an appropriate adaptive 
radius based on a suitable nonmonotone strategy can improve the efficiency 
and robustness of the trust-region framework as well as can decrease the com- 
putational cost of the algorithm by decreasing the number of subproblems 
that must be solved. The global convergence to first order stationary points 
as well as the local q-quadratic convergence of the proposed approach are 
proved. Numerical experiments show that the new algorithm is promising 
and attractive for solving nonlinear systems. 


Keywords: Nonlinear equations; Trust-region framework; Adaptive radius; 
Nonmonotone technique. 


1 Introduction 


In this paper, we consider the nonlinear system of equations 
F(x) =0, «€R", (1) 


where F : R” — R” is a continuously differentiable mapping in the form 
F(a) := (Fi(2), Fo(x),--+ ,Fn(x))?. If F(a) has a zero, then the nonlinear 
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system (1) is equivalent to the following nonlinear unconstrained least-squares 
problem 
min f(x) = 5||F(2)|)? (2) 
s.t. 2 ER”, 
where ||.|| denotes the Euclidean norm. The trust-region frameworks for 
solving system of nonlinear equations (1) are a popular class of iterative 
procedures that in each iteration generate a trial step d, by computing an 
exact or approximate solution of the following subproblem 


min Mr(LE + d) = $||Fe + Jel? = fet aoe Fy + $d TE Ted (3) 
st d€R” and |ld| < Ag. 


where fy := f (ax), Fh := F(x), Jp := F' (xx), Jacobian of F(x), and Ay > 0 
is trust-region radius. The ratio 
Se — F(t + dk) 


a mz(~) — Me(eE + de) (4) 


plays a main role in trust region frameworks. Obviously, the model matches 
the original problem at the current iteration x, whenever r; is sufficiently 
close to 1. Then the agreement is weak or there is no agreement whenever 
r, is near zero and there is not agreement when rx, is negative. Generally, 
if r, is greater than a positive constant py, the trial step d;, will be accepted 
and leading to r,41 := x, + dx. In this case, it is safe to increase trust 
region radius in the next iteration. Otherwise, the trust-region radius should 
be shrunk and the subproblem (3) will be solved again to possibly find an 
acceptable trial point in the sequel of the process. 

It is known that the traditional trust-region framework has some draw- 
backs: a) the very small trust-region radius A;, increases the total number 
of the iterates, b) the remarkably large trust-region radius A; increases the 
total number of solving subproblems, c) ratio (4) does not suffice to cre- 
ate the agreement between the quadratic model and the objective function 
in trust-region methods leading to increasing computational cost. Using the 
adaptive radius is an appropriate idea to overcome drawbacks (a) and (b). As 
a result, many researchers have investigated on finding the best trust-region 
radius A;, but no one has actually claimed a general rule for generating 
the trust-region radius. Therefore, in order to decrease the total number of 
solving subproblems for an arbitrary problem, some adaptive processes de- 
termining the radius have been proposed, see [3,33,45]. For example, Zhang 
and Wang [44] proposed an adaptive radius by 


An = 0 ||Fell’, 


where 0 < c < 1 and 0.5 < 6 < 1 are constant, also pz is a non-negative 
integer starting from zero. The major advantage of this method is that the 
radius does not stay very large and therefore it is possible to prevent resolving 


A nonmonotone trust-region-approach with nonmonotone adaptive radius ... 103 


the trust region subproblem. This proposal has some disadvantages: Firstly, 
the sequence generated by this method is superlinearly convergent with the 
convergence order 26. Secondly, the efficiency of the numerical results is 
largely dependent on the choice of 6. Finally, this method cannot adequately 
prevent the very small trust-region radius. To overcome these drawbacks, 
Fan and Pan in [15] suggest that 


Ay = cP M|| Fx], 


where it is also another satisfactory radius with a constant M, an integer p; 
and c € (0,1). This choice for the trust region radius plays an important 
role in proving the quadratic convergence and also prevents some deal from 
introducing the intensely small trust-region radius. In this method, if || F%|| 
is very small, then the constant M must be chosen so large that the radius is 
not too small. But for some problems in which ||F;,]|| is large, M||F;,|| will be 
very large and the number of solving subproblems may be increased. Thus, 
the amount of computation and the cost of solving problem will be increased. 


One of the convenient ways to overcome the drawback (c), is the nonmono- 
tone techniques that can improve the efficiency and the robustness of trust 
region algorithms, especially when it is applied to highly nonlinear problems, 
in the presence of narrow curved valley, see for examples [1,2,4, 18-20, 43, 46]. 
Therefore, a nonmonotone strategy can be employed to increase the efficiency 
of the proposed procedures. 


Contribution. The primary goal in the design of the new method is de- 
creasing computational cost by combining two nonmonotone techniques and 
adaptive radius trust region. We hope that combining these two techniques 
can improve numerical performance and efficiency of algorithm. We attain 
this designed goal by building a new adaptive radius based on nonmonotone 
technique. The global convergence to first-order critical points along with 
q-quadratic convergence are being established. The numerical experiments 
confirm the efficiency and the robustness of the proposed method for solving 
systems of nonlinear equations. 


Organization. This paper is organized as follows: The structure of 
algorithm will be described after a new adaptive trust-region radius and a 
nonmonotone technique are proposed in Section 2. In Section 3, we will inves- 
tigate the global convergence and the quadratic convergence rate of the new 
algorithm under some suitable assumptions. Numerical results are reported 
in Section 4. Finally, we end up the paper by some conclusive remarks given 
in Section 5. 
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2 Motivation and algorithmic structure 


It is well-known that the traditional approaches in unconstrained optimiza- 
tion generally use a globalization technique, like line search or trust-region, 
to guarantee the global convergence of the algorithm. These globalization 
techniques mostly enforce a monotonicity to the produced sequence of the 
objective function values which usually causes a short step to be produced 
and so a slow numerical convergence encountering highly nonlinear problems 
in the presence of a narrow curved valley, see {1,2,4,9,18,43]. For example, 
the traditional trust-region framework exploits the ratio (4) which leads to 


fe — fro > mMn(an) — Me (aR + dy) > 0. 


This condition clearly implies that the sequence { f;,} should be monotone. 
In order to avoid this drawback of the Armijo-type line search globalization 
techniques, a nonmonotone strategy was introduced by Grippo, Lampariello 
and Lucidi in [18] for unconstrained optimization problems while they mod- 
ified the Armijo condition by the following condition 


f(t + onde) < fir) + Sangh Ue, (5) 
where 6 € (0,1), gp := Vf (ax), the gradient of f(a) in v,, and 


fick) = max, iiea)> k € NU {0}, (6) 


0<j< 

in which m(0) := 0 and 0 < m(k) < min{m(k— 1) +1, N} with N > 0. The 
theoretical and numerical results have shown that the proposed technique has 
some remarkable effects and improves both the possibility of finding the global 
optimum and the rate of convergence for algorithms. Motivated by their 
work, Deng et al. in [9] made some changes in the ratio (4) which assesses 
the agreement between the quadratic model and the objective function in 
trust-region methods. In addition thy introduced the first nonmonotone trust 
region algorithm. This idea was developed further by Zhou and Xiao [38,46], 
Xiao and Chu [87] and Toint [35,36]. The most common nonmonotone ratio 
is defined as follows: 


ae fury — f(@r + dx) (7) 


mr (LE) — Mr (LE + dx) , 


To overcome disadvantages (a) and (b), according to the proposed method 
by Esmaeili and Kimiaei [11]. We define the new adaptive radius by 


Ay := OF NF Ky, (8) 


in which 0 < c < 1, px is the smallest nonnegative integer p such that 7, > us 
and 
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NF yp) := max FR; ke NU{0 9 
may = janax {Il Fel} {0}, (9) 


in which m(0) := 0 and 0 < m(k) < min{fm(k —1)+1,N} with N > 0. 
The proposed adaptive trust region radius has some benefits. First, since 
the sequence {NF'j(x)} is reduced slowly and is greater than the sequence 
{||Fu||} (see (11)), it prevents introducing the intensely small trust-region 
radius as possible and thus prevents increasing the total number of iterates. 
Second, Due to the decreasing sequence {N Fx) }, Ax will not stay too large 
and it prevents increasing the number of solving subproblems. Hence, using 
controlling the radius of trust-region, the new method can prevent the pro- 
duction of larger trial step near the optimizer and smaller trial step far from 
the optimizer. 

Our assumptions are identical to those utilized for the proposed approach: 


(H1) The level set D(a) := {x € R” | f(x) < f(xo)} is bounded for any 
given zo € R” and F(z) is continuously differentiable on compact convex set 
Q containing the level set L(x). 


(H2) The matrix {J;} is bounded and uniformly nonsingular on Q, i.e. there 
exists constants 0 < Mo < 1 < M, such that 


|| Jel] < Mi and Mol|Fxl| < Je Full Vk ¢ NU {0}, (10) 


(H3) The decrease on the model mz, is at least as much as a fraction of that 
obtained by the Cauchy point, i.e. there exists a constant 6 € (0,1) such 
that 


= T ; Il Je Fall 
Mx(rE) — Me(rE + dk) = B |l|Jq Fel] min) Ax, =e p> (11) 
| Ji, Fell 


for all k € NU {0}. 
(H4) J(a) is Lipschitz continuous in L(x), with Lipschitz constant yz. 


We now incorporate both of the two nonmonotone and adaptive radius 
terms into trust-region and outline the subsequent algorithm: 
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Algorithm 1 NATR (Nonmonotone Adaptive Trust-Region Algorithm) 
Input: An initial point zo € R”, c,u € (0,1), N > 0, € > 0 and kmaz. 
Output: xp, fo; 


1 Begin 
2 Ao := ||Folls fico) = 1/2I|Fioyll? ; NF yo) = ||Folls m0) == 0; k = 0; 
3 While ||F,||>¢« && k<kmaz do 


4 p:=0; TR := 0; 

5 While 7; < u do 

6 specify the trial point d, by solving the subproblem (3) ; 
7 compute F(z + dx); 

8 f (ee + dx) = 1/2 ||F (we + de) |I?5 

9 determine 7, using (7); 

10 If 7, <p then 

1 p+pt+l; 

12 determine A; using (8); 

13 end 

14 end 

15 Ceo1 = 2p + de; Pegi = F(ep41); fepa = f(@egi); Ini = I (e441); 
16 compute J,41 and let m(k + 1) := min{m(k) + 1, N}; 

17 calculate NFi(x41) by (9) and set fi(n4iy *= 1/2N Figgas 
18 select Ayia t= NFicp41); 

19 ke k+1; 

20 end 

21 Tp := Le; fo:= fei 

22 end 


In Algorithm 1, the cycle starting from Line 3 to Line 25 is called the 
outer cycle, and the cycle starting from Line 5 to Line 14 is called the inner 
cycle. 


Remark 1. The inequality (11) is called the sufficient reduction condition, 
see [32] and has been investigated by many authors when they extended some 
inexact methods for solving subproblem (3), for example see [11, 13-15]. For 
global convergence purpose, it is enough to find a vector d; such that it gives 
a sufficient reduction in the quadratic model m,. Well-known convergence 
results [31] show that the trial step d, is required to give a reduction in the 
model m; that is at least some fixed multiple of the decrease attained by the 
Cauchy step at each iteration. 


Lemma 1. Suppose that (H4) holds, the sequence {xx} is generated by Al- 
gorithm 1 and dy is a solution of the subproblem (3) such that ||F(a,) + 
J (ap) dz|| < ||F(xx)||. Then, we have 


|f (ve + dy) — mx(ae + de))| < O(||dx|l?). (12) 


Proof. See [9]. 
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The following lemma indicates that the inner cycle of Algorithm 1 termi- 
nates in a finite number of inner iterates. 


Lemma 2. Suppose that (H2)-(H4) hold and the sequence {xx} is generated 
by Algorithm 1. Then, the inner cycle of Algorithm 1 is well-defined. 


Proof. Assume that the inner cycle of Algorithm 1 cycles infinitely in the 
iteration k, ie., A? = cPNF 4) — 0 as p — 00, equivalently, for any 7 > 0, 
we have A? < for sufficiently large p. Using the fact that x, is not the 
optimum of (2), we can conclude that there exists a constant ¢ > 0 such that 
|| F;.|| > €. Without loss of generality, let 7 := ae This fact along with (H2) 


and (11) imply 


T Il Je Fell 
mr (te) — Me (xe + AR) > BllJi Fall min { Ap Ie Fall \ 


M 
> oMo|Fil| min{ ap, Teh ag 
I 
> BMoe min {AZ,n} 
= BMocAj, 


where dj, is a solution of subproblem (3) corresponding to p in k-th iterate. 
Now, Lemma 1 and (13) leads to 


fe — f(te + &%) i|- fen ea mx (xp + dp) 
Mx (Le) — Me (TE + A) Mr(&R) — Me(ae + dp) 

O(\ldzl|) z O((Az)?) 

ae BMocA;y a BMocAX 


+0, as poo. 


Therefore, there exists a sufficiently large pz such that 


fa — f(en + dh”) 
Th= > U. 
* ig (@n) — mu (ee + a) — © 


Besides, from the definition fj), it is clear that fix) = fx. This fact along 
with the previous inequality immediately implies 7, > rp > 42 which means 
that the inner cycle of Algorithm 1 stops and so Algorithm 1 is well-defined. 


Lemma 3. Suppose that (H1) holds and the sequence {x,} is generated by 
Algorithm 1. Then, for allk € NU{O}, we have x, € L(ao) and the sequence 
{NF ix)} is decreasing and convergent. 


Proof. Using the definition of N F'i(,), we have 


|Fell < NF ix). 
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By induction, the result evidently holds for k = 0 because N Fo) = ||Foll- 
Assuming x; € L(g) for i = 1,2,...,k, we show that 2,4, € L(29), for all 
k EN. It can be seen 


NFiay — ||Fesill? 
D D 


= fuk) — fer > w(me (Te) — Me(Le + de)) > 0, 


SO 
|Fitill < NFicey < || Foll- 


Thus, the sequence {z,} is contained in L(xo). It will be proved that the 
sequence {NV Fi; } is a decreasing sequence. We consider two following cases: 


i) k > N. In this case, we have m(k) = N. So, the definition of N Fix) 
along with this fact that ||Fi.41|| < NF,) implies that 


NFyqcsy = ms {llFea-all} $ max nay {I|Fi-yll}s Feel 
= max{NF yx), ||Fetill} = NF ica): 
ii) k < N. In this case, we have m(k) = k. For any &k, ||Fx|| < || Foll, 
NF) = Fo, Vk: 


These cases show that the sequence {NF1z)} is a decreasing sequence. Ac- 
cording to assumption H; and x, € L(xo), one can see that the sequence 
{NF i,)} is convergent. 


By Lemma 3 and since f(x,) = $||F(xx)||?, we can conclude that the 


sequence { fix) } is also decreasing and convergent. 


3 Convergence theory 
In this section, we provide the global convergence and q-quadratic rate of 
results of the proposed algorithm. 


Lemma 4. Suppose that {x,} is the sequence generated by Algorithm 1. 
Then, we have 

jim NF) = lim |F(xx)II- 
Proof. By Lemma 3.2 in [1] and f(z) = $||F(xx)||?, we have 


pare Fury = fre P(en). 


This implies that 
lim NF i) = lim ||F(xx)|l. 
k—-0o k—-0o 
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In order to establish the global convergence of Algorithm 1, one needs to 
establish the following results. 


Lemma 5. Suppose that assumptions (H2) and (H3) hold, the sequence {x;,} 
is generated by Algorithm 1 and dy, is a solution of the subproblem (8). Then, 
we have 

mx(e) — Me(ae + de) > Le ||Fell?, (14) 


where Ly := 8Mo min {or, My. 


Proof. Using (H2) and (11), we have 


T Je Fell 
Mx(Lk) — Me(LE + dx) > Bll Jy Fe|| min 4 Ax, =F 
Il Ji Fell 
Il Je Fell 
= B|| J? F,| min feo" k); 
: OTE Fa 
: Mo|| Fi 
> Mol Fill min {| Fy, “S| 
I 


M 
> BMol| Fr min {om an} 


= Ly||Fxll’, 


where Ly = BMop min { ore, Mel. Therefore, the proof is complete. 
1 


At this point, the global convergence of Algorithm 1 based on the men- 
tioned assumptions can be investigated. 


Theorem 6. Suppose that Assumptions (H1)-(H4) hold. Then, Algorithm 
1 either stops at a stationary point of f(a) or generates an infinite sequence 
{x,} such that 
lim ||F),|| = 0. (15) 
k-0o 
Proof. By contradiction, let there exists a constant « > 0 and an infinite 
subset K CN satisfying 


||F,|| >, forall ke K. (16) 
Using (16), 7% > and Lemma 5, we can conclude that 
fuy—frva = ficey—f (@etde) > ule (ee)—me(ae+de)] > wll Fell? Le > we? Le. 


The left-hand side of above inequality tends to become zero when k goes to 
infinity and therefore L; tends to 0. This means that pz, — oo that clearly 
is a contradiction with Lemma 2. Therefore, the hypothesis (16) is not true 
and the proof is complete. 
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To establish the quadratic convergence rate of the sequence generated by 
Algorithm 1, an additional assumption is required as follows (see [11, 14,15, 
40, 44]). 


(H5) There exist constants cj > 1 and p; € (0,1) such that 
cil|e — x.|| < |F@)|| = l|lF@) — F@.)|], Ve € N (as, pi). 
where x, is a solution of (1) and N(x, p1) := {2| ||~ — xx|| < pr}. 


Remark 2. By (H1) and (H4), the objective function F(a) is continuously 
differentiable and J(x) is Lipschitz continuous. So, there exist two constants 
yi > 0 and pz € (0,1) such that 


F(x) — Fy) + J(x)(«— yl < yell — yl’, for all 2, y € N(as, p2). 


For the purpose of our q-quadratic convergence, we simply choose p := 
min[p1, p2]- 
Theorem 7. Suppose that Assumptions (H1)-(H5) hold and the sequence 
{xz} generated by Algorithm 1 converges to x,. Then, for k sufficiently 
large, we have 

Lk41 = e+ dy, 

where dy is the solution of (3) corresponding to py = 0. Furthermore, the 
sequence {xx} converges to x, g-quadratically. 


Proof. Let d? be a solution corresponding to px = 0 of the subproblem (3) , 
so dy is a feasible solution for (3). This along with Lemma 3 and Theorem 1 
imply 

|qgI| < AR = NF yx) 30, as ko. (17) 
On the other hand, since p, = 0 and Mp < 1 < M,, we obtain 


BMG 
Lp = : 
k We 


(18) 


Because of the fact that Algorithm 1 is not stopped, it is clear that we have 
||F,|| > ¢. This fact together with Lemma 2, (17) and (18) suggests that 


fe — f(@n + dp) i| = mr(xe + dp) — f (xn + dp) 

Me (&e) — Me(rE + dp) a Mz(e) — Me (rE + a) 
= Olli@@ 2). O(A9)”) 
~ Dell Fell? 


BME 3 +0, as k> oO. 
M; 


This along with the fact fi(,) = fx, for sufficiently large k, implies 


Th = [. 
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Thus, for all sufficiently large k, the trial point d? is accepted by Algorithm 
Les Le. k+l] = Uk + a. 


At this point, the quadratic convergence of the sequence {x,} generated 
by Algorithm 1 is investigated. Regarding (H2), there exists a constant 
M, > 0 such that 

|Jxl| < Mi, for alla eQ. (19) 


Using (19) along with the mean value theorem, for all x, € N(x,,~), we can 
easily see that 


|Full = Fe — Fw) = NCO Mee — vel] S$ Millar — xsl], (20) 


for some € € [az, 2]. As a result of this fact and Lemma 4, for any sufficiently 
large k, it can be concluded that 


Fix) < Millen — z«|\, 


and so 
IIdnll < NFiey < Millan — vel. (21) 


From (H5), it is clear that 
1 1 0 
I|zx — &s|| < G, Fell os oP < NF ix) = Ax. 


This fact directly implies that x, — 2, is a feasible point for the subproblem 
(3). Now, it is straightforwardly followed from Remark 2 and (21) that 


1 
gllFe + Ip dy.||? = me(a~ + dy) < mx (xe + (@e — er) 


= 5 lFla t Jy (Xp wr 
(22) 
= SF — Fe + Se(we — 24)IP 


2 
YL 4 

< llr, — 2,|I*. 
5 llr L,|| 


Also (H6), (21) and (22), give us 


ciliates — tl] SF (@e41)ll = Fee + aI 
< [Fx + Jed] + O(a”) 


< yz (tn Sell? 
for any sufficiently large k. So 


IIex41 — tl] = O(||zx — xl). 
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Hence, the sequence {x;,} generated by the Algorithm 1 is q-quadratically 
convergent. Therefore, the proof is completed. 


4 Numerical experiments 


In this section, we report some numerical experiments obtained by running 
Algorithm 1 (NATR) in comparison with the nonmonotone trust-region algo- 
rithm (NTR), the adaptive trust-region algorithm from Zhang et al. in [44] 
(ATRZ), the nonmonotone version of it (NATRZ), the adaptive trust-region 
algorithm of Fan and Pan in [15] (ATRF) and the nonmonotone version of it 
(NATRF) on a set of nonlinear systems of equations with the dimension from 
100 to 504 that are selected from the wide range of literatures . The problems 
1-36 are chosen from Cruz et al. in [25] and the problems 37-42 are chosen 
from LukSan and Vléek in [28]. For all of these codes, the trust-region sub- 
problems are solved by Steihaug-Toint procedure, see [8]. The Steihaug-Toint 
algorithm terminates at x, +d when 


[Vinge + All < min f Vinge +A pVimatee + ah 


The Jacobian matrix Jz can be either evaluated analytically by a user- 
supplied function or approximated using finite-differences formula provided 
by the code. Since the exact computation cannot be appropriate for large 
scale problems, similar to [5], we used the following finite-differences formula 
to approximate the Jacobian matrix J, 


1 
[Ja].j5 ~ 7 - (Flee + hye;) — Fr), 
j 
where [J;,].; denotes the j-th column of J, e; is the j-th vector of the canonic 
basis and 


ce if Xk; = 0, 
hj = 


JVemsign(xp,) max{|rp,|, llealla y otherwise. 


All codes are written in MATLAB 9 programming environment with double 
precision format in the same subroutine. In our numerical experiments, the 
algorithms were stopped whenever 


\|Fell < 107°, 


or when the total number of iterates exceeded 1000. During implementations, 
It is checked that the codes be converged to the same point and only provided 
data for problems that all algorithms converged to the identical point while 
less than of 1 percent of problem was ignored. 
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Figure 1: Iterates performance profile for the presented algorithms 


While NATR algorithm takes advantages of the parameters pp = 107°, 
c=0.5. The NTR algorithm employs the parameters 4; = 0.1, v2 = 0.9 and 
updates trust-region radius like [8] by the following formula 


ci|[del| if re < pe, 
Agia s= < Ax if fy < re < po, 
cgA, if re > pa, 


where cy; = 0.25 and cp = 0.3. We also decide to follow the literature [34] in 
exploiting Ap = 1 as an initial trust-region radius for NTR. The parameters 
of ATRZ and ATRF have been chosen the same as in articles [44] and [15], 
respectively. Table 1 indicates the names and dimensions of the test problems 
considered. Figures 1 and 2 give the performance profiles for all of the algo- 
rithms with the choice of finite-differences Jacobian matrix for total number 
of iterations and total number of function evaluations, respectively. Perfor- 
mance profile gives, for every 7 > 1, the fraction of the number of problems 
for which the algorithm is within a factor of r of the best [10]. 

Figure 1 clearly indicates that NATR outperforms NTR, ATRZ, ATRF, 
NATRZ and NATRF regarding the total number of iterates. In particular, 
NATR has the most wins in nearly 81% of the test problems with the greatest 
efficiency. Meanwhile, in the sense of the ability of completing a run success- 
fully, it is the best among considered algorithms because it grows up faster 
than the others and reaches 1 more rapidly. However, as illustrated in Figure 
2, NATR implements are remarkably better than the others where it has the 
most wins for almost 77% of performed tests concerning the total number of 
function evaluations. Furthermore, Figures 1 and 2 show similar patterns in 


114 K. Amini, H. Esmaeili and M. Kimiaei 


Figure 2: Function evaluations performance profile for the presented algo- 
rithms 


the sense of the ability of completing a run successfully. As a result, this fact 
directly implies that the total number of solving the trust-region subprob- 
lems is the notably decreased thanks to using the NATR algorithm. These 
results imply that the proposed algorithm is an efficient and robust approach 
for solving systems of nonlinear equations. 


5 Concluding remarks 


It is well-known that trust-region methods for solving systems of non- 
linear equations have a remarkable numerical reliability as well as strong 
theoretical convergence properties. Practical experiments of the trust-region 
framework indicate that applying nonmonotone adaptive techniques for de- 
termining trust-region radius declines the number of solving subproblems 
and employing nonmonotone strategies increases the efficiency and robust- 
ness of the algorithm. In this paper, by exploiting an effective adaptive trust 
region radius based on a reliable nonmonotone strategy, a new nonmono- 
tone trust region algorithm is introduced for solving systems of nonlinear 
equations. Nevertheless, these modifications in the traditional trust-region 
procedure are favorably encouraging so that the global and q-quadratic con- 
vergence properties of the proposed algorithms are established. Numerical 
results on a set of nonlinear systems indicate that the number of iterates 
and the number of function evaluations are so close to each other that, by 


A nonmonotone trust-region-approach with nonmonotone adaptive radius ... 115 
Table 1: List of test functions 

Problem name Dim Problem name Dim 
Exponential 1 500 Geometric 100 
Exponential 2 500 ~=—- Function 27 500 
Extended Rosenbrock 500 ~=‘Tridimensional valley 501 
Chandrasekhar’s H-equation 500 Complementary 500 
Singular 500  Hanbook 500 
Logarithmic 500 ~=—-‘ Tridiagnal system 500 
Broyden tridiagonal 500. ~—- Five-diagonal system 500 
Trigexp 500  Seven-diagonal system 504 
Variable band 1 500 Extended cragg and levy 500 
Variable band 2 500 Extended Wood 500 
Function 15 500 ~—s Triadiagnal exponential 500 
Strictly convex 1 500. ~—- Brent 500 
Strictly convex 2 500 Thorech 500 
Function 18 501  Broyden banded 500 
Zero Jacobian 500 ‘Discrete integral equation 500 
Geometric programming 100  Countercurrent reactors 1 504 
Function 21 501 Singular Broyden 500 
Linear function-full rank 1 500  =Structured Jacobian 500 
Linear function-full rank 2 500 Extended Powell Singular 500 
Brown almost linear 500 Generalized Broyden banded — 500 
Variable dimensioned 500 Extended powell badly scaled 500 


applying the proposed algorithm, significant profits in computational costs 


can be obtained. 
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